NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multiscale genesis of a tiny giant for percolation on scale-free random graphs

https://doi.org/10.1214/24-AOP1735

Bhamidi, Shankar; Dhara, Souvik; van_der_Hofstad, Remco (July 2025, The Annals of Probability)

Free, publicly-accessible full text available July 1, 2026
Scaling limits and universality: Critical percolation on weighted graphs converging to an 𝐿³ graphon

https://doi.org/10.1090/tran/9267

Baslingker, Jnaneshwar; Bhamidi, Shankar; Broutin, Nicolas; Sen, Sanchayan; Wang, Xuan (February 2025, Transactions of the American Mathematical Society)

We develop a general universality technique for establishing metric scaling limits of critical random discrete structures exhibiting mean-field behavior that requires four ingredients: (i) from the barely subcritical regime to the critical window, components merge approximately like the multiplicative coalescent, (ii) asymptotics of the susceptibility functions are the same as that of the Erdős-Rényi random graph, (iii) asymptotic negligibility of the maximal component size and the diameter in the barely subcritical regime, and (iv) macroscopic averaging of distances between vertices in the barely subcritical regime. As an application of the general universality theorem, we establish, under some regularity conditions, the critical percolation scaling limit of graphs that converge, in a suitable topology, to an $L^{3}$ graphon. In particular, we define a notion of the critical window in this setting. The $L^{3}$ assumption ensures that the model is in the Erdős-Rényi universality class and that the scaling limit is Brownian. Our results do not assume any specific functional form for the graphon. As a consequence of our results on graphons, we obtain the metric scaling limit for Aldous-Pittel’s RGIV model inside the critical window (see D.J. Aldous and B. Pittel [Random Structures Algorithms 17 (2000), pp. 79–102]). Our universality principle has applications in a number of other problems including in the study of noise sensitivity of critical random graphs (see E. Lubetzky and Y. Peled [Israel J. Math. 252 (2022), pp. 187–214]). In Bhamidi et al. [Scaling limits and universality II: geometry of maximal components in dynamic random graph models in the critical regime, In preparation], we use our universality theorem to establish the metric scaling limit of critical bounded size rules. Our method should yield the critical metric scaling limit of Ruciński and Wormald’s random graph process with degree restrictions provided an additional technical condition about the barely subcritical behavior of this model can be proved (see A. Ruciński and N. C. Wormald [Combin. Probab. Comput. 1 (1992), pp. 169–180]).
more » « less
Free, publicly-accessible full text available February 18, 2026
Representation, ranking and bias of minorities in sampling attributed networks

https://doi.org/10.1007/s13278-024-01326-6

Antunes, Nelson; Banerjee, Sayan; Bhamidi, Shankar; Pipiras, Vladas (December 2024, Social Network Analysis and Mining)

Full Text Available
Co-evolving dynamic networks

https://doi.org/10.1007/s00440-024-01274-4

Banerjee, Sayan; Bhamidi, Shankar; Huang, Xiangying (June 2024, Probability Theory and Related Fields)

Full Text Available
Geometry of the minimal spanning tree in the heavy-tailed regime: new universality classes

https://doi.org/10.1007/s00440-024-01259-3

Bhamidi, Shankar; Sen, Sanchayan (April 2024, Probability Theory and Related Fields)

Full Text Available
The spatial distribution of coupling between tau and neurodegeneration in amyloid-β positive mild cognitive impairment

https://doi.org/10.1016/j.neurobiolaging.2024.01.014

Robinson, Belfin; Bhamidi, Shankar; Dayan, Eran (April 2024, Neurobiology of Aging)

Full Text Available
Statistical Significance of Clustering with Multidimensional Scaling

https://doi.org/10.1080/10618600.2023.2219708

Shen, Hui; Bhamidi, Shankar; Liu, Yufeng (January 2024, Journal of Computational and Graphical Statistics)

Clustering is a fundamental tool for exploratory data analysis. One central problem in clustering is deciding if the clusters discovered by clustering methods are reliable as opposed to being artifacts of natural sampling variation. Statistical significance of clustering (SigClust) is a recently developed cluster evaluation tool for high-dimension, low-sample size data. Despite its successful application to many scientific problems, there are cases where the original SigClust may not work well. Furthermore, for specific applications, researchers may not have access to the original data and only have the dissimilarity matrix. In this case, clustering is still a valuable exploratory tool, but the original SigClust is not applicable. To address these issues, we propose a new SigClust method using multidimensional scaling (MDS). The underlying idea behind MDS-based SigClust is that one can achieve low-dimensional representations of the original data via MDS using only the dissimilarity matrix and then apply SigClust on the low-dimensional MDS space. The proposed MDS-based SigClust can circumvent the challenge of parameter estimation of the original method in high-dimensional spaces while keeping the essential clustering structure in the MDS space. Both simulations and real data applications demonstrate that the proposed method works remarkably well for assessing the statistical significance of clustering. Supplementary materials for this article are available online.
more » « less
Full Text Available
Learning attribute and homophily measures through random walks

https://doi.org/10.1007/s41109-023-00558-3

Antunes, Nelson; Banerjee, Sayan; Bhamidi, Shankar; Pipiras, Vladas (December 2023, Applied Network Science)

Abstract We investigate the statistical learning of nodal attribute functionals in homophily networks using random walks. Attributes can be discrete or continuous. A generalization of various existing canonical models, based on preferential attachment is studied (model class $$\mathscr {P}$$ P ), where new nodes form connections dependent on both their attribute values and popularity as measured by degree. An associated model class $$\mathscr {U}$$ U is described, which is amenable to theoretical analysis and gives access to asymptotics of a host of functionals of interest. Settings where asymptotics for model class $$\mathscr {U}$$ U transfer over to model class $$\mathscr {P}$$ P through the phenomenon of resolvability are analyzed. For the statistical learning, we consider several canonical attribute agnostic sampling schemes such as Metropolis-Hasting random walk, versions of node2vec (Grover and Leskovec, 2016) that incorporate both classical random walk and non-backtracking propensities and propose new variants which use attribute information in addition to topological information to explore the network. Estimators for learning the attribute distribution, degree distribution for an attribute type and homophily measures are proposed. The performance of such statistical learning framework is studied on both synthetic networks (model class $$\mathscr {P}$$ P ) and real world systems, and its dependence on the network topology, degree of homophily or absence thereof, (un)balanced attributes, is assessed.
more » « less
Full Text Available
Improved baselines for causal structure learning on interventional data

https://doi.org/10.1007/s11222-023-10257-9

Richter, Robin; Bhamidi, Shankar; Mukherjee, Sach (October 2023, Statistics and Computing)

Abstract Causal structure learning (CSL) refers to the estimation of causal graphs from data. Causal versions of tools such as ROC curves play a prominent role in empirical assessment of CSL methods and performance is often compared with “random” baselines (such as the diagonal in an ROC analysis). However, such baselines do not take account of constraints arising from the graph context and hence may represent a “low bar”. In this paper, motivated by examples in systems biology, we focus on assessment of CSL methods for multivariate data where part of the graph structure is known via interventional experiments. For this setting, we put forward a new class of baselines called graph-based predictors (GBPs). In contrast to the “random” baseline, GBPs leverage the known graph structure, exploiting simple graph properties to provide improved baselines against which to compare CSL methods. We discuss GBPs in general and provide a detailed study in the context of transitively closed graphs, introducing two conceptually simple baselines for this setting, the observed in-degree predictor (OIP) and the transitivity assuming predictor (TAP). While the former is straightforward to compute, for the latter we propose several simulation strategies. Moreover, we study and compare the proposed predictors theoretically, including a result showing that the OIP outperforms in expectation the “random” baseline on a subclass of latent network models featuring positive correlation among edge probabilities. Using both simulated and real biological data, we show that the proposed GBPs outperform random baselines in practice, often substantially. Some GBPs even outperform standard CSL methods (whilst being computationally cheap in practice). Our results provide a new way to assess CSL methods for interventional data.
more » « less
Full Text Available
Fluctuation bounds for continuous time branching processes and evolution of growing trees with a change point

https://doi.org/10.1214/22-AAP1881

Banerjee, Sayan; Bhamidi, Shankar; Carmichael, Iain (August 2023, The Annals of Applied Probability)

Full Text Available

« Prev Next »

Search for: All records